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Abstract -Financial markets are well known examples of multi-fractal complex systems that 
have garnered much interest in their characterization through complex network theory. The 
recent studies have used correlation based distance metrics for defining and analyzing financial 
networks. In this work the singularity strength is employed to define a distance metric and the 
existence of hierarchical structure in the Johannesburg Stock Exchange is investigated. The multi- 
fractal nature of the financial market, which is otherwise hidden in the correlation coefficient based 
prescriptions, is analyzed through the use of the singularity strength based method. The presence 
of a super cluster is exhibited in the network which accounts for half of the network size and is 
homogeneous in the sectoral composition of the South African market. 
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Study and characterization of financial networks has 
sjeen a surge in recent years based on the pioneering work 
by Mantegna |T| . Financial networks have been considered 
iia the context of collective behavior [5] and asset trees [3] 
in the New York Stock Exchange and the Dow Jones In- 
dustrial Average [3]; hierarchical clustering in the Turk- 
ish market [5], the trading volume [6] and stock prices 
[7] in the Korean market. Several methods similar to 
Mantegna's prescription have also been proposed; like syn- 
chronization of chaotic maps [5] and statistical validation 
based on an unsupervised algorithm [3] to study financial 
networks. 

The construction of financial networks requires the iden- 
tification of a metric that provides the notion of distance 
between different entities in the network. The metric used 
in Rcf. [TJ 
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with Cij being the correlation coefficient is calculated be- 
tween the return time series of different equities i and j 
trading in the market. This method however does not 
account for the well known scaling and multi-fractal na- 
ture of financial markets [TUhT¥| . Hence, to further un- 
derstand the relation of the multi-fractality and network 
properties, it would be of considerable interest to employ 
a multi-fractality based measure of distance and explore 
the possibility of finding community structure in financial 
markets. 

In this work, we will use the singularity strength or 
the Holder exponent to define the "singularity metric" 
that provides the notion of distance between two multi- 
fractal time series. We calculate the this metric by the use 
of Wavelet Based Multi Fractal De-trended Fluctuation 
Analysis (WBMFDFA). This distance metric will then be 
used to probe the clustering properties of a financial net- 
work formed by the equities trading on the Johannesburg 
Stock Exchange (JSE), through single linkage hierarchical 
clustering to show the existence of one super-cluster which 
is homogeneous over the different sectors of the economy. 
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We verify these findings by studying the network proper- 
ties through the adjacency matrix as a function of a cut-off 
threshold defined on the basis of the singularity strength. 
We will also analyze the multi-fractal properties of the JSE 
by using the Hurst exponent which is related to the multi- 
fractal scaling exponent h(r) as H = h(r = 2). We find 
that the JSE fluctuations collectively show near-diffusive 
behavior corresponding to k~ 2 (inverse square law) with a 
few exceptions which notably are amongst the high capital 
industries in South Africa. 

Mandelbrot in his seminal work in 1963 |T31|TB] pro- 
posed the departure from Gaussianity of empirical price 
change distributions leading to exploration of stationary 
and non-stationary time series in various fields. The De- 
trended Fluctuation Analysis (DFA) proposed by Peng 
et al. in 1994 [T7l422| ) is based on the mono-fractal hy- 
pothesis of the time series where the single Hurst expo- 
nent is sufficient to explain the self similarity of the 
time series. Kantclhardt et al. subsequently proposed the 
generalized Multi Fractal De-trended Fluctuation Analysis 
(MFDFA) [TO] in 2002, which has found wide applications 
in understanding the scaling behavior of financial markets 
(for example see [TlTfl"4] ). Using the multi- resolution ca- 
pability of wavelets [MIUS], Manimaran et al. proposed 
the Wavelet Based MFDFA (WBMFDFA) gM28] in 2005 
which has been used to study the emergence of scaling and 
self-similar behavior in various financial markets [2"9"1. This 
method has also been applied to other areas for exam- 
ple, quark-hadron phase transition in plasma data [30] and 
cancer detection [3Tj . 

In one dimensional Discrete Wavelet Transform (DWT) 
24,32], a real valued function £(a?) can be decomposed as 
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where, 4>(x) and ^J(x) are square integrable functions 
forming the orthonormal basis for a L 2 (C) Hilbcrt space 
and are called the father and mother wavelets respec- 
tively. They are subject to the admissibility conditions: 
/ 4>{x)dx < oo, / ty{x)dx = 0, / 4>{x)*^{x)dx = and 
/ \4>{x)\ 2 dx = J \^(x)\ 2 dx = 1. The ipij(x) are called the 
daughter wavelets and are related to the mother wavelet 
ty(x) by scaling and translation (and in higher dimensional 
cases also rotation) by ipij(x) = 2*' 2 \D r (2 , a? — j), where, i 
and j are the scaling and translation parameters respec- 
tively. Thus, at the zth scale, the daughter wavelet 4>ij(x) 
is 2 % times the height and 2 1 / 2 the width of the mother 
wavelet ^(x). In cq. ©, Oj and bij are the low pass and 
high pass coefficients which respectively carry information 
about the low frequency behavior or "average behavior" 
and high frequency behavior or "detail behavior" of the 
function. In case of the Daubechies wavelets, the ty(x) is 
also made to satisfy the vanishing moment condition, that 
is, J ^{x^x^dx = 0,p < P; the analysis of the Pth order 
derivative of the function £(x) becomes possible. This is 
instrumental in the extraction of trends of various poly- 



nomial orders from the function. For example, since the 
second wavelet of the Daubechies family Db-4 has two van- 
ishing moments, it can be used to extract constant and 
monomial trends, while Db-6 with three vanishing mo- 
ments can extract constant, linear and quadratic trends. 
This property is particularly helpful in performing a fluc- 
tuation analysis where we are interested in specific scales 
and specific patterns of the function. 

In WBMFDFA, given a non-stationary time series x(t), 
we firstly make it stationary by evaluating the normalized 
log return scries [15] given by R(t) = r ^~^ t ^ | where 

(• • • ) represents the time average, r{t) = \ogx{t + 1) — 
\ogx{t) is the log returns and a r u) — y (r(t) 2 ) — (r(t)) 2 
is the volatility of the series. We then perform the fluc- 
tuation extraction on the profile calculated as Y(n) = 

n 

J2 R(t),n e [1,N - 1], where N is the length of the 

t=l 

return series. The fluctuations at every scale are ob- 
tained by reconstructing the trend from the low pass co- 
efficients obtained by performing a DWT with the Db- 
4 wavelet and then subtracting it from the Y(n). The 
details of the method have been discussed in [26H291I31] . 
Since the asymmetry of the Db-4 basis leads to convolu- 
tion errors and edge effects [24[|2"5] . we reverse the profile, 
perform the same fluctuation extraction on the reversed 
profile and take the average to avoid the artificial errors 
that may be produced and obtain the fluctuation fj at 
each scale j. These fluctuations are then segmented into 
M w = mt(N/w) non-overlapping segments, where w is 
the window size. The w is related to the scale j by the 
number of filter coefficients for the given wavelet. The 
resulting rth order fluctuation function F r (w) (r £ Z) is 
obtained [TO] as 



F r (w) = 




(3) 



It can be immediately observed that F r (w) diverges at 
r = and hence Fq(w) is calculated as 



Fq(w) = exp 



2M 



— ^ log [F 2 (m,w)] 



l/r 



r/2 



(4) 



For a self similar process, the fluctuation function F r (w) 
follows a scaling law, the scaling function h(r) is given 
by F r (s) - w h ^ which is related to the classical par- 
tition function based multi-fractal scaling exponent r(r) 
by r(r) = rh(r) - 1 [SUM]- This r(r) is related to the 
singularity spectrum /(/?) which represents the dimension 
of the subset of the time series, by a Legendre transform 
ft = d/dr[r(r)] where ft is called the singularity strength or 
the Holder exponent and f(ft) = rft — r(r) [10]. Thus, the 
singularity strength, singularity spectrum and the scaling 
function are related by f{ft) = r{ft — h(r)} + 1. 

The singularity strength converges to zero in the case 
of mono-fractals and increases with multi-fractality. We 
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must however, note that though the Hurst exponent de- 
fined as H = h(r = 2), is bounded in [0, 1]; the singularity 
strength only has a lower bound of zero for the mono- 
fractal case and higher the multi-fractality, more is the 
width of the singularity spectrum. This makes the width 
defined by 

7 = max(/3) — min(/3) (5) 

a perfect candidate for characterizing a multi-fractal time 
series also allowing for the introduction of a metric for 
comparing different time-series. We must note here that 
H = implies a white noise H = 0.5 corresponds 

to a f~ 2 power law process and H = 1 is the signature of 
inverse cubic law process /~ 3 . Processes exhibiting H = 
0.5 and H = 1 are also called as diffusive or Brownian and 
ballistic processes respectively. 

We define the "singularity metric" for two multi-fractal 
time series {X : x\, Xi, X3, • • • > x n ; n G Z, x% £ M} and 
{Y : y u y 2 , y 3 , ...,y„;n £ Z, Vl e R} as p : R x K -> R+, 
where, R + = [0, oo) such that 

p(X,Y) = \ lx - lY \. (6) 

It can be easily verified that this singularity metric p 
satisfies the conditions: p(X,Y) > (non- negativity), 
p(X,Y) = & lx = 1Y, p{X,Y) = p(Y,X) (symmetry) 
and p(X,Z) < p{X,Y) + p{Y,Z) (subadditivity) . Thus, 
for given n time scries, we can construct the matrix p nxn 
as 

Pij = \li-1 3 \- (?) 

In this work, we use the historical closing prices of 580 of 
the approximately 1200 equities listed under 472 compa- 
nies trading on the Johannesburg Stock Exchange (JSE) 
from 1 st January, 1990 to 29 t/l August, 2008 sampled at 
one trading day interval. It is particularly interesting to 
analyze the JSE as, a) it is a developing economy and 
b) since 2001, it has through an agreement allowed cross 
trading with the London Stock Exchange (FTSE) which 
implies that the JSE will possibly be affected by the Euro- 
pean financial weather. The South African economy can 
be classified into five major sectors: natural resources, 
agriculture, manufacturing, services industry and trade & 
investment. Of these, natural resources and the services 
industry are the most dominant sectors. 

The graph partitioning methods of finding fc-clusters in 
large graphs are shown to be unreliable due to the lack 
of a-priori knowledge of the exact number of clusters in 
the graph and thus failure to correctly partition the nodes 
into clusters [55H57] , Hence, multilevel clustering tech- 
niques like agglomerative clustering algorithms and divi- 
sive clustering algorithms are used in order to reveal the 
hierarchical structure of the networks. An agglomerative 
clustering algorithm is a bottoms-up approach where, we 
start off as a set isolated nodes and as edges are added, 
based on their similarity, clusters are formed. It has been 
particularly useful in detecting hierarchical structures in 
large networks from social and biological networks (for ex- 
ample see [381139] ) and in financial networks (for example 



see [10131]). Hence, using the singularity metric p, we de- 
fine the distance between each equity and use the single 
linkage agglomerative hierarchical clustering technique to 
organize the equities into clusters. 

As shown in the dendrogram in fig. [I] the network 
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Fig. 1: Dendrogram representing the hierarchical clustering of 
the equities using the p denned in eq. ((6]). For the sake of 
illustrative convenience, we have showed the dendrogram only 
to the top six branches. Notice the clusters A, B, C, D, E and 
F which contain 51.73%, 21.97%, 12.98%, 3.81%, 4.94% and 
8.65% of the total leaves respectively. 

breaks down into six dominant clusters denoted by A, B, 
C, D, E and F; each containing 51.73%, 21.97%, 12.98%, 
3.81%, 4.94% and 8.65% of the total leaves respectively. 
We must note here that we have shown only the top six 
branches for the sake of brevity. This implies the presence 
of a super-cluster A that contains over half of the nodes. 
However, to validate this finding, let us look towards find- 
ing the graph properties of the network. 

Looking at the distribution of p, as seen from fig. [2[ 
we can see that the f3 shows a multi-modal density distri- 
bution. Hence, can collect the nodes falling under a spe- 
cific range such that Ae [min(p), 1.72), Be [1.72,3.36), 
Ce [3.36,4.77), De [4.77,5.45), EG [5.45,6.08) and Fe 
[6.08,max(p)]. This provides us information about the 
percentage of nodes falling under a specific category. We 
have the following distribution of nodes in each of the six 
categories from A-F: A=51.90%, B=21.97%, C=13.15%, 
D=5.88%, E=4.67% and F=2.42%. We can see that 
51.90% of the nodes are very close to each other, while 
2.42% are the farthest from each other. This is also con- 
sistent with our findings from the hierarchical clustering. 
The three biggest clusters have a comparable number of 
nodes. 

Given a undirected graph Q = (V, £) with no self loops, 
where V is the set of all vertices or nodes and £ is the 
set of all edges in the graph Q, it is represented by its 
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Fig. 2: The probability distribution P(pij) of the singularity 
metric elements pij. The multi-modal distribution is evident 
which allows us to make segmentation of the distribution func- 
tion into six distinct groups and perform a network analysis 
with them. 



adjacency matrix A such that Aij = 1 if the nodes i and j 
are connected and otherwise. Also An is zero since the 
graph has no self loops. The average degree of the graph 
Q is defined as (k) = 1/(N(N - 1)) £ Aij 02]. The 

characteristic path length which is the mean of all the 
geodesies between the nodes is given as L = 1/(N(N — 
1)) dij where, G V is the geodesies between the 

nodes i and j and T> is the distance matrix. The graph 
efficiency is calculated as E = 1/(N(N — 1)) l/d%j- 

The average clustering coefficient for a graph is given by 



(C) 



N ^ 



2{|e 3fc }l 
ki(ki 1) 



,v k G Vi,e jk G £. (8) 



and the vertex centrality, which is a score of how impor- 
tant is a node in a given topological configuration for the 
node i is given by 



n jk 



(9) 



where, n^-fc is the number of geodesies between the nodes 
j and k and njk(i) are the number of geodesies between j 
and fc passing through i. The average vertex centrality is 
given by B = Y^Bi- We must note that these quantities 

i 

are also dependent on the topology of the graph under 
consideration. The computation of these quantities was 
done using the MATLAB BGL software provided in 03]. 

We define a threshold parameter £ G [min(/3),max(/3)] 
such that, we can calculate the adjacency matrix A(£) = 
Aij(£) = 1 when < £ and zero otherwise. For example, 



Fig. 3: The adjacency matrix at -Ay(£*), the critical value of 
£ = 0.96. Notice the presence of overlapping communities in 
the region corresponding to pij > 4.47. Also, we can see that 
the communities only interact with their neighbors. 



we have plotted the adjacency matrix formed for £ = 0.96 
in fig. [3J We get the corresponding quantities average de- 
gree, characteristic path length, graph efficiency and av- 
erage clustering coefficient (&(£)), E(£) and (C(£)) 
as functions of the threshold parameter £ respectively. We 
have plotted some of these quantities as a function of the 
threshold parameter £ in fig. @]ibr A(£). 
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Fig. 4: Graph Efficiency (E(£)), clustering coefficient (C(£)) 
and average graph degree (fc(£)) plotted as a function of £ for 
the network A{£). Notice that at = 0.96, the clustering co- 
efficient is 0.4662 and falls off in the region [0.96, 2] to rise again 
with a gentler slope. Correspondingly, in the same region, the 
average graph degree rises with a very small slope. However, 
there is no change in the slope of the graph efficiency. At £*, 
(£(£*)) and (fc(r)> are 0.2685 and 0.1797 respectively. 



p-4 



Singularity strength based characterization of financial networks 



We can observe from the definitions of the graph prop- 
erties that more nodes are added to the adjacency matrix 
as we go over the range of £; and hence (&(£)), (C(£)) 
and (E(£)) should be increasing functions of £. However, 
from fig. H we observe that the average clustering coef- 
ficient (C(£)) decreases in the interval [0.96,2] and then 
rises again. Correspondingly, the average degree 
also shows a change of slope in this region. It is interest- 
ing to note that the quantity (&(£)) changes very slowly 
in this region, while the graph efficiency (E(£)) does not 
show much deviation from the expected monotonic ris- 
ing behavior. This is an interesting finding since it im- 
plies that though the degree of the graph does not change 
appreciably, the graph undergoes substantial "re-wiring" 
in this regime which indicates dynamical restructuring of 
the relationship between the various equities when in cer- 
tain phases of their mult i-fr act ality. This could imply 
the occurrence of a structural phase-transition since the 
(C(£)) increases rapidly at low £, reaches a peak value at 
£ = 0.96 = £* and falls off to rise again but with a gentler 
slope. This value of £* can be considered as the critical 
value for the network for which (C(£*)) = 0.4662. The 
relationship of high clustering coefficient and modularity 
has recently been discussed in |44j . 
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Fig. 6: Hurst exponent H = h(r = 2), plotted for the 578 equi- 
ties analyzed in this work. The mean (H), represented by the 
red line is at 0.4701 and the standard deviation is represented 
by the blue broken lines at 0.4701 ± 0.0840 respectively. We 
can see that the some of the equities deviate significantly from 
the mean and the equity Kruger Rand - Half at x — 296 (on 
the z-axis has) #(296) = 0. 
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Fig. 5: The unnormalized average betweeness centrality (£>(£)) 
plotted as a function of the cut-of threshold £. Note that 
though at £*, there is a local maxima, the global maxima lies 
at £ = 0.5366, indicating an optimal network configuration at 
this value. 

In fig. [5J we have plotted the unnormalized (£?(£)) as 
a function of £. It can be seen that the (£?(£)} reaches 
a peak value of 926.19 at £ = 0.5366, while at £* it is 
significantly lower at 522.8. However, it is also evident 
that there is a local maxima of (B(£)) at £*. This could 
possibly mean that the most optimal configuration of the 
network is at £ = 0.5633 which implies a narrow band 
of the singularity strength where the equities differ from 



each other by p ~ 0.5633. This also corresponds to the 
maxima of the distribution function followed by ft as is 
evident from fig. [3] 

Figure [5] represents the Hurst exponent calculated at 
h(r=2) for the different equities trading on the JSE. We 
observe that the mean Hurst exponent for the equities 
is H = 0.4701 ± 0.0840 which indicates near-diffusive 
behavior. It can also be seen that 90 equities deviate 
significantly from the trend. However, only one equity 
namely Kruger Rand - Half (ticker symbol KRHT) shows 
H = 0. Amongst the deviating equities metal, mining, fi- 
nancial institutions services sector and holding companies 
are the major contributors, notably the platinum min- 
ing companies like Anglo American Platinum, Aquarius 
Platinum, Eastern Platinum, Impala Platinum and High- 
veld Steel & Vanadium. Some other notable companies 
in this range are the AECI, Banro Corp., British Amer- 
ican Tobacco, ABSA Bank, Alcoa Steel, ArcclorMittal, 
BHP Billiton, Goldone Internatinal, First Rand Limited 
(parent company of the First National Bank), Good Hope 
Diamond Mines, the Anglo American Group and Amalga- 
mated Breweries Ltd., a subsidiary of the SABMillcr Pic. 
which is one of the largest breweries in the world. 

In conclusion, in this work, we have explored financial 
networks from a multi-fractal analysis point of view. The 
resulting study throws light on the fact that there exists 
a hierarchical community structure in the South African 
financial market based on the multi-fractality of the in- 
dividual equities. The fact that the singularity strength 
can be used to find such a clustering can be instrumental 
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in applying this technique to other kinds of time series. 
Also, we were able to show the presence of one super- 
cluster, through both agglomerative hierarchical cluster- 
ing and forming a graph based on the singularity metric. 
We also showed the presence of a narrow band-width dif- 
ference of Holder exponent for which the network shows 
maximum vertex centrality. The identification of a critical 
threshold parameter value £* which is almost double the 
value of £ for a optimum network configuration is indica- 
tive of a dynamic topological changes in the network in 
a particular regime of multi-fractality. We have also ex- 
plored the existence of a collective sub-diffusive motion of 
the equities determined by the mean Hurst parameter of 
0.4701±0.0840. This also indicates the presence of a power 
law behavior of the fluctuations. However, the deviation 
of a select few high capital industries from the average 
behavior is interesting which could possibly be due to tur- 
bulent geo-socio-political factors in the country in the last 
few decades and which would have to be checked with re- 
spect to the other economies and the socio-political factors 
modeled in conjunction with wider and more sophisticated 
studies. 
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